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Atty. Docket No. 600189-097 
CLAIM AMENDMENTS AND LISTING OF CLAIMS 

The listing of claims will replace all prior versions, and listings, of claims 

in the application. 

1 . (Currently Amended) A computerized method for estimating coverage 
of search engines, each search engine maintaining an index of words of pages located at 
specific address in a network, comprising the steps of: 

generating a random query, the random query being a logical combination 
of words found in a training set of the pages; 

submitting the random query to a first search engine; 

receiving a set of URLs in response to the random search query; 

randomly selecting a particular URL identifying a sample page; 

generating a strong query for the sample page; 

submitting the strong query to a second search engine; and 

comparing result information received in response to the strong query to 
determine if the second search engine has indexed the sample page r; and 

generating an estimate of the relative sizes and amount of overlap of the 
indices of the first and second search engines. 

2. (Previously Presented) The method of claim 1 comprising estimating 
relative sizes of the indices of the first and second search engines by dividing a fraction 
of a first set of pages sampled from the second search engine that are contained in the 
first search engine by a fraction of a second set of pages sampled from the first search 
engine that are contained in the second search engine. 
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3. (Previously Presented) The method of claim 2 comprising estimating a 

relative amount of overlap of the indices of the first and second search engines by 
computing a fraction of a set of pages sampled from the second search engine that are 
combined in the first search engine. 

4. (Previously Presented) The method of claim 1 wherein the training set 
of pages relates to a particular context domain. 

5. (Previously Presented) The method of claim 1 wherein the random 
query combines random words selected from the training set with a logical operator. 

6. (Previously Presented) The method of claim 1 wherein the random 
query is a disjunctive query. 

7. (Previously Presented) The method of claim 6 wherein the disjunctive 
query combines a set of words using OR operators, the set of words having a 
predetermined size. 

8. (Previously Presented) The method of claim 7 wherein the words of the 
training set have relative frequencies that are substantially similar. 

9. (Previously Presented) The method of claim 1 wherein the random 
query is a conjunctive query combining a pair of words and an AND operator. 

10. (Previously Presented) The method of claim 9 comprising: 

sorting the words in the training set according to frequencies of the words; 

and 

establishing an upper frequency threshold and a lower frequency threshold 
so that when words equidistant from the upper and lower thresholds are combined in the 
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conjunctive query, a set of addresses is less than or equal to a predetermined maximum 

number of members. 

1 1 . (Previously Presented) The method of claim 1 wherein the network is 
the World Wide Web and comprising: 

fetching the particular URL from the first search engine; 
fetching a corresponding page from the World Wide Web; and 
constructing the strong query to be representative of the sample page. 

12. (Previously Presented) The method of claim 1 wherein the result 
information includes URLs of pages indexed by the second search engine. 

13. (Previously Presented) The method of claim 12 wherein the URLs of 
the pages indexed and addresses identifying the sample pages are normalized before the 
comparing. 

14. (Previously Presented) The method of claim 12 wherein the result 
information being compared is content of the sample page, and content of the pages 
indexed by the second search engine. 

15. (Previously Presented) The method of claim 12 wherein the result 
information includes host names. 

16. (Previously Presented) The method of claim 1 comprising discarding 
dynamic and outdated pages before comparing. 

17. (Previously Presented) The method of claim 1 wherein privileged 
access is provided to the first search engine. 
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